Far-end data migration device and method based on FPGA cloud platform

ABSTRACT

A far-end data migration device and method based on a FPGA cloud platform. The device includes a server, a switch, and a plurality of FPGA acceleration cards. The server transmits data to be accelerated to the FPGA acceleration cards by means of the switch. The FPGA acceleration cards are configured to perform a primary and/or secondary acceleration on the data, and are configured to migrate the accelerated data. The method includes: transmitting data to be accelerated to a FPGA acceleration card from a server by means of a switch; performing, by the FPGA acceleration card, a primary and/or secondary acceleration on the data to be accelerated; and migrating, by the FPGA acceleration card, the accelerated data.

This application claims priority to Chinese Patent Application No.202010031268.7, filed on Jan. 13, 2020, in China National IntellectualProperty Administration and entitled “Far-End Data Migration Device andMethod Based on FPGA Cloud Platform”, the contents of which are herebyincorporated by reference in its entirety.

FIELD

The present disclosure relates to the technical field of FieldProgrammable Gate Array (FPGA)-based data migration applications, andparticularly to a far-end data migration device and method based on aFPGA cloud platform.

BACKGROUND

Cloud computing is an Internet-based computing mode. In this mode,shared hardware and software resources and information may be providedfor computers and other devices as needed. Data grows in a cloud byabout 30% per year, and meanwhile, rapid development of ArtificialIntelligence (AI) also makes requirements for high-performance datacomputing. As a result, a conventional Central Processing Unit (CPU) isunable to solve problems about computing performance. A FPGA isconfigured for computing acceleration in a data center by virtue of itsadvantages of high performance, low latency, flexible extensibility, lowpower consumption, etc. Currently, Microsoft, Amazon, Baidu, Tencent,Alibaba, and other data centers have all launched FPGA cloud platformsto implement computing acceleration by taking FPGAs as sharableresources in clouds. Multiple FPGA accelerator units may form acomputing resource pool through a network, thereby implementingdistributed data acceleration. The key to the implementation of adistributed FPGA cloud platform is how to implement data migration indifferent FPGA accelerator units and improve the data migrationefficiency.

FIG. 1 is a schematic diagram of a network topology of an existing FPGAcloud platform. FPGA boards are connected to a switch through MediaAccess Control (MAC) network interfaces to form a FPGA resource pool.The FPGA board may be in the form of a Peripheral Component InterconnectExpress (PCIE) acceleration card in a server, or multiple independentFPGA boards in a Just a Bunch Of FPGAs (JBOF, a FPGA pool including FPGAacceleration cards only). The FPGA cloud platform is generallyconfigured to accelerate an algorithm with a quite large amount of datacomputation, such as an AI algorithm, picture processing, and genesequencing. Multiple FPGA boards are needed to accelerate an algorithmcooperatively by data interaction between the multiple FPGA boards.

A Remote Direct Memory Access (RDMA) technology is a modernhigh-performance network communication technology based on hardwareacceleration. RDMA over Converged Ethernet (RoCE), a technology commonlyused for FPGA clouds currently, defines how to implement RDMA over theEthernet. RoCE directly transmits data from a memory of a computer toanother computer without interventions of operating systems of bothsides. FIG. 2 is a schematic diagram of functions of an existing RDMAtechnology. When a server A transmits data to a server B, an applicationof the server A executes an RDMA write request, and without theparticipation of a kernel memory, the RDMA write request is sent fromthe application running in a user space to a cache Memory of a FPGAboard A with a RDMA function. The FPGA board A reads the data in thecache Memory, and transmits the data to a cache Memory of a FPGA board Bthrough a network. Then, the FPGA board B directly writes the data to anapplication cache of the server B.

A RDMA protocol standard is set by the InfiniBand Trade Association(IBTA, the setter of the Infiniband standard) to implement datatransmission between endpoints. A FPGA needs to follow the RDMA protocolstandard to realize a RDMA function, which makes it relatively complexto realize the function and occupies many FPGA resources. In addition,the RDMA standard defines a protocol standard for data transmissionbetween two hosts, but does not define any protocol standard for datatransmission between FPGA boards in a JBOF topology, and it is necessaryto seek for a method suitable for data migration between FPGA boards ina JBOF topology.

SUMMARY

Embodiments of the present disclosure provide a far-end data migrationdevice and method based on a FPGA cloud platform, so as to solve theproblem of data acceleration and migration between FPGA boards in a JBOFtopology under a FPGA cloud platform.

The embodiments of the present disclosure disclose the followingtechnical solutions.

A first aspect of the present disclosure provides a far-end datamigration device based on a FPGA cloud platform, including a server, aswitch, and FPGA acceleration cards. The device includes a plurality ofFPGA acceleration cards. The server transmits data to be accelerated tothe plurality of FPGA acceleration cards by means of the switch. Theplurality of FPGA acceleration cards are configured to perform a primaryand/or secondary acceleration on the data, and are configured to migratethe accelerated data.

Further, the FPGA acceleration card includes a SHELL and a FPGAAccelerator Unit (FAU). The SHELL is configured as an interfaceconnection between the FPGA acceleration card and the switch. The SHELLis configured to migrate data on the FPGA acceleration card. The FAU isconfigured to perform the primary and/or secondary acceleration on thedata on the FPGA acceleration card.

Further, the SHELL includes an iRDMA, a Memory, a PCIE, and a MAC. TheMemory is connected with the iRDMA. The iRDMA is connected with the PCIEand the MAC. In response to the Memory on the FPGA acceleration cardbeing accelerated by the FAU, the iRDMA is configured to implement datamigration between the Memory and FAU on the FPGA acceleration card. Inresponse to the data being migrated on the plurality of FPGAacceleration cards, the iRDMA implements data migration between Memorieson the plurality of FPGA acceleration cards through MAC interfaces.

Further, an acceleration algorithm of the FAU includes LZ77 and Huffman.The LZ77 acceleration algorithm performs a first-stage compression onthe data on the FPGA acceleration card to implement a primary dataacceleration. The Huffman acceleration algorithm performs a second-stagecompression on primarily accelerated data on the FPGA acceleration cardto implement a secondary data acceleration.

Further, the iRDMA includes a bridge module Bridge, a message processingmodule pkt_pro, and a parsing module fau_parse. The message processingmodule pkt_pro parses and encapsulates a data migration read/writeinstruction message received by a PCIE interface or the MAC interface.The parsing module fau_parse is configured to parse a data migrationread instruction message initiated by the FAU. The bridge module Bridgeis configured to convert migration instructions parsed by the bridgemodule Bridge and the message processing module pkt_pro into timing ofreading/writing the Memory interface.

A second aspect of the present disclosure provides a far-end datamigration method based on a FPGA cloud platform, including:

-   -   transmitting data to be accelerated to a FPGA acceleration card        from a server by means of a switch;    -   performing, by the FPGA acceleration card, a primary and/or        secondary acceleration on the data to be accelerated; and    -   migrating, by the FPGA acceleration card, the accelerated data.

Further, the step of performing, by the FPGA acceleration card, theprimary and/or secondary acceleration on the data to be acceleratedspecifically includes:

-   -   performing, by a FAU of the FPGA acceleration card, primary        and/or secondary acceleration on the data to be accelerated.

Further, the step of migrating, by the FPGA acceleration card, theaccelerated data specifically includes:

-   -   in response to data in a Memory on the FPGA acceleration card        being accelerated by a FAU, implementing, by an iRDMA of the        FPGA acceleration card, data migration between the Memory and        FAU on the FPGA acceleration card, and in response to the        accelerated data being migrated on a plurality of FPGA        acceleration cards, implementing, by the iRDMA, data migration        between Memories on the plurality of FPGA acceleration cards        through MAC interfaces.

Further, an acceleration algorithm of the FAU includes LZ77 and Huffman.The LZ77 acceleration algorithm performs a first-stage compression onthe data on the FPGA acceleration card to implement the primary dataacceleration. The Huffman acceleration algorithm performs a second-stagecompression on primarily accelerated data on the FPGA acceleration cardto implement the secondary data acceleration.

Further, the iRDMA includes a bridge module Bridge, a message processingmodule pkt_pro, and a parsing module fau_parse. The message processingmodule pkt_pro parses and encapsulates a data migration read/writeinstruction message received by PCIE interface or the MAC interface. Theparsing module fau_parse is configured to parse a data migration readinstruction message initiated by the FAU. The bridge module Bridge isconfigured to convert migration instructions parsed by the bridge moduleBridge and the message processing module pkt_pro into timing ofreading/writing the Memory interface.

The effects provided in SUMMARY are not all effects of the presentdisclosure but only effects of the embodiments. One of the abovetechnical solutions has the following advantages or beneficial effects.

According to the far-end data migration device and method based on aFPGA cloud platform in the present disclosure, data migration between aplurality of FPGA acceleration cards is completed by read/writeinstructions defined by the iRDMA on the FPGA acceleration cards, theFAUs, and the MAC interfaces. According to the present disclosure, aRoCE protocol is simplified, the present disclosure may be applied to aJBOF topology, the transmission efficiency is high, and thecompetitiveness of a cloud platform product of an enterprise isimproved.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to describe the technical solution in the embodiments of thepresent disclosure or the prior art more clearly, the drawings requiredto be used in descriptions about the embodiments or the prior art willbe introduced briefly below. Apparently, those ordinarily skilled in theart may further obtain other drawings according to these drawingswithout creative work.

FIG. 1 is a schematic diagram of a network topology of an existing FPGAcloud platform according to the present disclosure;

FIG. 2 is a schematic diagram of functions of an existing RDMAtechnology according to the present disclosure;

FIG. 3 is a structural block diagram of a device according to thepresent disclosure;

FIG. 4 is a module block diagram of an iRDMA according to an embodimentof the present disclosure; and

FIG. 5 is a flowchart of a method according to the present disclosure.

DETAILED DESCRIPTION

In order to describe the technical features of the present solutionclearly, the present disclosure will be described below in detail withspecific embodiments in combination with the drawings. The followingdisclosure provides many different embodiments or examples to implementdifferent structures of the present disclosure. In order to simplify thepresent disclosure, components and settings in specific examples aredescribed below. In addition, in the present disclosure, referencenumerals and/or letters may be reused in different examples. Such reuseis for brevity and clarity and does not indicate a relationship betweeneach embodiment and/or setting that is discussed. It is to be noted thatthe components shown in the drawings are not necessarily drawn to scale.Descriptions about known components and processing technologies andprocesses are omitted in the present disclosure so as to avoidunnecessary limitations on the present disclosure.

FIG. 3 is a structural block diagram of a device according to thepresent disclosure. The device includes a server, a switch, and aplurality of FPGA acceleration cards. The server transmits data to beaccelerated to the FPGA acceleration card by means of the switch. TheFPGA acceleration cards are configured to perform a primary and/orsecondary acceleration on the data, and are configured to migrate theaccelerated data.

The FPGA acceleration card includes a SHELL (the SHELL is a FPGA shellunit, a static part in a FPGA project that is unmodifiable by a user)and a FAU (an application acceleration module that is dynamicallyreconfigurable). The SHELL is configured to migrate data on the FPGAacceleration card. The FAU is configured to perform the primary and/orsecondary acceleration on the data on the FPGA acceleration card.

The SHELL realizes an interface function of a FPGA, including a PCIEDirect Memory Access (DMA) interface, a MAC interface, a Memoryinterface, etc. The SHELL is a static part of the FPGA that isunmodifiable by a user. The FAU is an accelerator unit reconfigurable bythe user. Different users may load different acceleration applications,and different boards may also load different applications. For example,a FAU of the board A may use a Convolutional Neural Network (CNN)acceleration algorithm, while a FAU of the board B uses a Deep NeuralNetwork (DNN) acceleration algorithm, but static SHELL parts of the twoboards are consistent.

The SHELL includes an iRDMA (customized RDMA in the present disclosurefor a FPGA cloud platform), a Memory, PCIE, and MAC (in layer 2 in anetwork). The Memory is connected with the iRDMA. The iRDMA is connectedwith the PCIE and the MAC. When the server migrates the data to the FPGAacceleration card, the iRDMA implements data migration between a CPUMemory on the server and the Memory of the FPGA acceleration card. Whenthe data in the Memory on the FPGA acceleration card is accelerated bythe FAU, the iRDMA is configured to implement data migration between theMemory and FAU on the FPGA acceleration card. When the accelerated datais migrated on the plurality of FPGA acceleration cards, the iRDMAimplements data migration between the Memories on the plurality of FPGAacceleration cards through the MAC interfaces.

An acceleration algorithm of the FAU includes LZ77 and Huffman. The LZ77acceleration algorithm performs a first-stage compression on the data onthe FPGA acceleration card to implement the primary data acceleration.The Huffman acceleration algorithm performs a second-stage compressionon the primarily accelerated data on the FPGA acceleration card toimplement the secondary data acceleration.

The acceleration algorithm combines two algorithms, i.e., a dictionarymode LZ77 algorithm and Huffman for redundancy statistics, therebyachieving a high compression rate. The Huffman algorithm depends on theLZ77 algorithm. The two algorithms may be used for acceleration of twoFPGA acceleration cards respectively.

FIG. 4 is a modular block diagram of iRDMA according to an embodiment ofthe present disclosure. The iRDMA includes a bridge module Bridge, amessage processing module pkt_pro, and a parsing module fau_parse. Themessage processing module pkt_pro parses and encapsulates a datamigration read/write instruction message received by the PCIE interfaceor the MAC interface. The parsing module fau_parse is configured toparse a data migration read instruction message initiated by the FAU.The bridge module Bridge is configured to convert migration instructionsparsed by the bridge module Bridge and the message processing modulepkt_pro into timing of reading/writing the Memory interface.

The message processing module pkt_proc receives an iRDMA read/writeinstruction message input from the PCIE interface or the MAC interface,and sends a read/write instruction obtained by a message parsing processto the bridge module Bridge. The bridge module Bridge converts theread/write instruction into timing of reading/writing the Memoryinterface to complete reading or writing the Memory.

In case of an iRDMA read instruction, the bridge module Bridge readsdata from the Memory, and then sends the data to the message processingmodule pkt_proc. The message processing module pkt_proc completes aprocess such as message encapsulation, and then sends a message to thePCIE interface or the MAC interface.

The FAU completes a data acceleration process, and then initiates aniRDMA_rd instruction. The read instruction is sent to the bridge moduleBridge after processing of the message processing module pkt_proc. Thebridge module Bridge converts the read instruction into Memory readtiming. The data read from the Memory is processed by the bridge moduleBridge, and then is sent to the message processing module pkt_proc. Themessage processing module pkt_proc performs message encapsulation, andthen sends a message to the PCIE interface or the MAC interface foroutput.

According to the method of the present disclosure, a customized methodfor completing data migration simply and efficiently under a FPGA cloudplatform of a JBOF topology is provided, a FPGA parses a customizediRDMA read/write instruction message to complete data migrationautomatically, and a FAU may also trigger an instruction for datamigration.

FIG. 5 is a flowchart of a method according to the present disclosure.The method includes the following steps:

-   -   data to be accelerated is transmitted to a FPGA acceleration        card from a server by means of a switch;    -   the FPGA acceleration card performs the primary and/or secondary        acceleration on the data to be accelerated; and    -   the FPGA acceleration card migrates the accelerated data.

The step that data to be accelerated is transmitted to a FPGAacceleration card from a server by means of a switch specificallyincludes that: the data to be accelerated is transmitted to the switchfrom the server, and then is transmitted to a PCIE interface of the FPGAacceleration card from the switch.

The step that the FPGA acceleration card performs the primary and/orsecondary acceleration on the data to be accelerated specificallyincludes that: a FAU of the FPGA acceleration card performs primaryand/or secondary acceleration on the data to be accelerated.

The step that the FPGA acceleration card migrates the accelerated dataspecifically includes that:

-   -   when data in a Memory on the FPGA acceleration card is        accelerated by a FAU, iRDMA of the FPGA acceleration card        implements data migration between the Memory and the FAU on the        FPGA acceleration card, and when the accelerated data is        migrated on a plurality of FPGA acceleration cards, the iRDMA        implements data migration between the Memories on the plurality        of FPGA acceleration cards through the MAC interfaces.

An acceleration algorithm of the FAU includes LZ77 and Huffman. The LZ77acceleration algorithm performs the first-stage compression on the dataon the FPGA acceleration card to implement the primary dataacceleration. The Huffman acceleration algorithm performs thesecond-stage compression on the primarily accelerated data on the FPGAacceleration card to implement the secondary data acceleration.

The iRDMA includes a bridge module Bridge, a message processing modulepkt_pro, and a parsing module fau_parse. The message processing modulepkt_pro parses and encapsulates a data migration read/write instructionmessage received by a PCIE interface or the MAC interface. The parsingmodule fau_parse is configured to parse a data migration readinstruction message initiated by the FAU. The bridge module Bridge isconfigured to convert migration instructions parsed by the bridge moduleBridge and the message processing module pkt_pro into timing ofreading/writing the Memory interface.

A detailed working process of the method of the present disclosure is asfollows.

Data to be accelerated is transmitted to a PCIE interface of a firstFPGA acceleration card from a server by means of a switch.

The first iRDMA of the first FPGA acceleration card receives a writeinstruction, and stores the data to be accelerated on the PCIE interfacein a first Memory of the first FPGA acceleration card. The first iRDMAreceives a read instruction, and reads the data to be accelerated fromthe first Memory to a first FAU of the first FPGA acceleration card. Thefirst FAU performs first-stage acceleration on the data to beaccelerated by means of the LZ77 acceleration algorithm to implementprimary data acceleration.

The first FAU sends an iRDMA read instruction after completing theprimary data acceleration. The first iRDMA of the first FPGAacceleration card receives the read instruction, and reads the primarilyaccelerated data from the first Memory. The first iRDMA of the firstFPGA acceleration card encapsulates the primarily accelerated data intoan iRDMA write instruction, and transmits the iRDMA write instruction toa second FPGA acceleration card through a first MAC interface of thefirst FPGA acceleration card.

The second FPGA acceleration card receives the iRDMA write instruction,and stores the primarily accelerated data in a second Memory of thesecond FPGA acceleration card. The second iRDMA of the second FPGAacceleration card receives a read instruction, and reads the primarilyaccelerated data from the second Memory to a second FAU of the secondFPGA acceleration card. The second FAU performs the second-stageacceleration on the primarily accelerated data by means of the Huffmanacceleration algorithm to implement the secondary data acceleration.

In the present disclosure, a customized iRDMA data migration method isprovided for a JBOF network topology, thereby implementing datamigration based on a FPGA cloud platform simply and efficiently. TheiRDMA module uses 15K Look-Up-Table (LUT) resources (important FPGAresources) of a FPGA, occupying about 1% of LUT resources of a VU37PFPGA, while RoCE needs 40K LUT resources, occupying about 3% of the LUTresources of the VU37P FPGA (a FPGA with many Xilinx FPGA resources),thus iRDMA improves the data migration efficiency greatly.

Hereinbefore, it is only the preferred embodiment of the presentdisclosure. Those ordinarily skilled in the art may further make aplurality of improvements and embellishments without departing from theprinciple of the present disclosure, and these improvements andembellishments shall also be regarded as falling within the scope ofprotection of the present disclosure.

What is claimed is:
 1. A far-end data migration device based on a FieldProgrammable Gate Array (FPGA) cloud platform, comprising a server, aswitch, and a plurality of FPGA acceleration cards, wherein the servertransmits data to be accelerated to the plurality of FPGA accelerationcards by means of the switch; and the plurality of FPGA accelerationcards are configured to perform at least one of a primary accelerationor a secondary acceleration on the data to yield accelerated data, andare configured to migrate the accelerated data; wherein each of theplurality of FPGA acceleration cards comprises a SHELL and a FPGAAccelerator Unit (FAU), wherein the SHELL is configured as an interfaceconnection between the FPGA acceleration card and the switch, and isconfigured to migrate the data on the FPGA acceleration card; and theFAU is configured to perform the at least one of the primaryacceleration or the secondary acceleration on the data on the FPGAacceleration cart; the SHELL comprises an iRDMA, a Memory, a PeripheralComponent Interconnect Express (PCIE), and a Media Access Control (MAC),wherein the Memory is connected with the iRDMA; the iRDMA is connectedwith the PCIE and the MAC; in response to data in the Memory on the FPGAacceleration card being accelerated by the FAU, the iRDMA is configuredto implement data migration between the Memory and the FAU on the FPGAacceleration card; and in response to the accelerated data beingmigrated on the plurality of FPGA acceleration cards, the iRDMAimplements data migration between the Memories on the plurality of FPGAacceleration cards through MAC interfaces.
 2. The far-end data migrationdevice based on the FPGA cloud platform according to claim 1, whereinacceleration algorithms of the FAU comprise LZ77 and Huffman, whereinthe LZ77 acceleration algorithm performs a first-stage compression onthe data on the FPGA acceleration card to implement the primaryacceleration to yield primarily accelerated data; and the Huffmanacceleration algorithm performs a second-stage compression on theprimarily accelerated data on the FPGA acceleration card to implementthe secondary acceleration.
 3. The far-end data migration device basedon the FPGA cloud platform according to claim 2, wherein the Huffmanacceleration algorithm depends on the LZ77 acceleration algorithm. 4.The far-end data migration device based on the FPGA cloud platformaccording to claim 1, wherein the iRDMA comprises a bridge moduleBridge, a message processing module pkt_pro, and a parsing modulefau_parse, wherein the message processing module pkt_pro parses andencapsulates a data migration read/write instruction message received bya PCIE interface or the MAC interface; the parsing module fau_parse isconfigured to parse a data migration read instruction message initiatedby the FAU; and the bridge module Bridge is configured to convertmigration instructions parsed by the bridge module Bridge and themessage processing module pkt_pro into timing of reading/writing aMemory interface.
 5. The far-end data migration device based on the FPGAcloud platform according to claim 4, wherein the message processingmodule pkt_pro receives an iRDMA read/write instruction message inputfrom the PCIE interface or the MAC interface, and sends a read/writeinstruction obtained by a message parsing process to the bridge moduleBridge.
 6. The far-end data migration device based on the FPGA cloudplatform according to claim 5, wherein under a circumstance of the iRDMAread/write instruction message being an iRDMA read instruction, thebridge module Bridge reads data from the Memory, and then sends the datato the message processing module pkt_pro.
 7. The far-end data migrationdevice based on the FPGA cloud platform according to claim 6, whereinthe FAU completes a data acceleration process, and then initiates aniRDMA_rd instruction.
 8. The far-end data migration device based on theFPGA cloud platform according to claim 1, wherein the far-end datamigration device is provided for a Just a Bunch Of FPGAs (JBOF) networktopology.
 9. A far-end data migration method based on a FieldProgrammable Gate Array (FPGA) cloud platform, wherein the methodcomprises following steps: transmitting data to be accelerated to a FPGAacceleration card from a server by means of a switch; performing, by theFPGA acceleration card, at least one of a primary acceleration or asecondary acceleration on the data to be accelerated to yieldaccelerated data; and migrating, by the FPGA acceleration card, theaccelerated data, wherein the step of migrating, by the FPGAacceleration card, the accelerated data comprises: in response to datain a Memory on the FPGA acceleration card being accelerated by a FPGAAccelerator Unit (FAU), implementing, by an iRDMA of the FPGAacceleration card, data migration between the Memory and the FAU on theFPGA acceleration card, and in response to the accelerated data beingmigrated on a plurality of FPGA acceleration cards, implementing, by theiRDMA, data migration between Memories on the plurality of FPGAacceleration cards through Media Access Control (MAC) interfaces. 10.The far-end data migration method based on the FPGA cloud platformaccording to claim 9, wherein the step of performing, by the FPGAacceleration card, the at least one of the primary acceleration or thesecondary acceleration on the data to be accelerated comprises:performing, by the FAU of the FPGA acceleration card, the at least oneof the primary acceleration or the secondary acceleration on the data tobe accelerated.
 11. The far-end data migration method based on the FPGAcloud platform according to claim 10, wherein acceleration algorithms ofthe FAU comprise LZ77 and Huffman, wherein the LZ77 accelerationalgorithm performs a first-stage compression on the data on the FPGAacceleration card to implement the primary acceleration to yieldprimarily accelerated data; and the Huffman acceleration algorithmperforms a second-stage compression on the primarily accelerated data onthe FPGA acceleration card to implement the secondary acceleration. 12.The far-end data migration method based on the FPGA cloud platformaccording to claim 11, wherein the Huffman acceleration algorithmdepends on the LZ77 acceleration algorithm.
 13. The far-end datamigration method based on the FPGA cloud platform according to claim 11,wherein the far-end data migration method is provided for a Just a BunchOf FPGAs (JBOF) network topology.
 14. The far-end data migration methodbased on the FPGA cloud platform according to claim 9, wherein the iRDMAcomprises a bridge module Bridge, a message processing module pkt_pro,and a parsing module fau_parse, wherein the message processing modulepkt_pro parses and encapsulates a data migration read/write instructionmessage received by a Peripheral Component Interconnect Express (PCIE)interface or MAC interface of the FPGA acceleration card; the parsingmodule fau_parse is configured to parse a data migration readinstruction message initiated by the FAU; and the bridge module Bridgeis configured to convert migration instructions parsed by the bridgemodule Bridge and the message processing module pkt_pro into timing ofreading/writing a Memory interface.
 15. The far-end data migrationmethod based on the FPGA cloud platform according to claim 14, whereinthe message processing module pkt_pro receives an iRDMA read/writeinstruction message input from the PCIE interface or the MAC interface,and sends a read/write instruction obtained by a message parsing processto the bridge module Bridge.
 16. The far-end data migration method basedon the FPGA cloud platform according to claim 15, wherein under acircumstance of the iRDMA read/write instruction message being an iRDMAread instruction, the bridge module Bridge reads data from the Memory,and then sends the data to the message processing module pkt_pro. 17.The far-end data migration method based on the FPGA cloud platformaccording to claim 16, wherein the FAU completes a data accelerationprocess, and then initiates an iRDMA_rd instruction.